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Abstract 

The branching fraction ratio = is mea¬ 

sured using a sample of proton-proton collision data corresponding to 3.0 fb~^ of 
integrated luminosity recorded by the LHCb experiment during 2011 and 2012. The 
tau lepton is identified in the decay mode t~ —)• The semitauonic decay 

is sensitive to contributions from non-Standard-Model particles that preferentially 
couple to the third generation of fermions, in particular Higgs-like charged scalars. 
A multidimensional fit to kinematic distributions of the candidate B^ decays gives 
TZ{D*) = 0.336 ± 0.027 (stat) ± 0.030 (syst). This result, which is the first measure¬ 
ment of this quantity at a hadron collider, is 2.1 standard deviations larger than the 
value expected from lepton universality in the Standard Model. 
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Lepton universality, enshrined within the Standard Model (SM), requires equality of 
couplings between the gauge bosons and the three families of leptons. Hints of lepton 
non-universal effects in —)■ K^e^e~ and —)■ decays Q have been seen, 

but no dehnitive observation of a deviation has yet been made. However, a large class of 
models that extend the SM contain additional interactions involving enhanced couplings 
to the third generation that would violate this principle. Semileptonic decays of b hadrons 
(particles containing a b quark) to third generation leptons provide a sensitive probe for 
such effects. In particular, the presence of additional charged Higgs bosons, which are 
often required in these models, can have a significant effect on the rate of the semitauonic 


decay B^ —)■ D*^T~T'r |^. The use of charge-conjugate modes is implied throughout this 
Letter. 

Semitauonic B meson decays have been observed by the BaBar and Belle col¬ 
laborations (3|-[7|. Recently BaBar reported updated measurements I^IT] of the ra¬ 


tios of branching fractions, 1Z{D*) = D*^t Ur)/B{B^ —z/^) and 

TZ{D) = B{B^ —> D^t~Vt)/B{B^ —)■ which show deviations of 2.7a and 2.0 ct, 


respectively, from the SM predictions These ratios have been calculated to high 

precision, owing to the cancellation of most of the uncertainties associated with the strong 
interaction in the B to D^*'^ transition. Within the SM they differ from unity mainly 
because of phase-space effects due to the differing charged lepton masses. 

This Letter presents the hrst measurement of TZ{D*) in hadron collisions using the 
data recorded by the LHCb detector at the Large Hadron Collider in 2011-2012. The data 
correspond to integrated luminosities of 1.0fb“^ and 2.0fb“^, collected at proton-proton 
{pp) center-of-mass energies of 7 TeV and 8 TeV, respectively. The B^ —)■ decay 

with T~ —)• (the signal channel) and the B^ —)• D *^decay (the normalization 

channel) produce identical visible hnal-state topologies; consequently both are selected 
by a common reconstruction procedure. The selection identihes semileptonic B^ decay 
candidates containing a muon candidate and a D*^ candidate reconstructed through the 
decay chain D*^ —Zl°(—)• . The selected sample contains contributions from 

the signal and the normalization channel, as well as several background processes, which 
include partially reconstructed B decays and candidates from combinations of unrelated 
particles from different b hadron decays. The kinematic and topological properties of 
the various components are exploited to suppress the background contributions. Finally, 
the signal, the normalization component and the residual background are statistically 
disentangled with a multidimensional ht to the data using template distributions derived 
from control samples or from simulation validated against data. 

The LHCb detector [Io| [TI| is a single-arm forward spectrometer covering the 
pseudorapidity range 2 < p < 5, designed for the study of particles containing b or 
c quarks. The detector includes a high-precision tracking system consisting of a silicon- 
strip vertex detector surrounding the pp interaction region 12 , a large-area silicon-strip 


detector located upstream of a dipole magnet with a bending power of about 4Tm, and 
three stations of silicon-strip detectors and straw drift tubes 13 placed downstream of 


the magnet. The tracking system provides a measurement of momentum, p, of charged 
particles with a relative uncertainty that varies from 0.5% at low momentum to 1.0% 
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at 200GeV/c. The minimum distance of a track to a primary vertex (PV), the impact 
parameter, is measured with a resolution of (15 + 29/pt) hiH; where px is the component of 
the momentum transverse to the beam, in GeV/c. Different types of charged hadrons are 
distinguished using information from two ring-imaging Cherenkov detectors (^. Photons, 
electrons and hadrons are identihed by a calorimeter system consisting of scintillating- 
pad and preshower detectors, an electromagnetic calorimeter and a hadronic calorimeter. 
Muons are identihed by a system composed of alternating layers of iron and multiwire 
proportional chambers 0!^. The online event selection is performed by a trigger 16 


which consists of a hardware stage, based on information from the calorimeter and muon 
systems, followed by a software stage, which applies a full event reconstruction. 

A simulation of pp collisions is provided by Pythia 17,18 with a specihc LHCb 

in which 


conhguration 19 


20 


Decays of hadronic particles are described by EvtGen 
hnal-state radiation is generated using Photos pT|. The interaction of the generated 


particles with the detector, and its response, are implemented using the Geant4 toolkit 22 
as described in Ref. 


23 
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The trigger requirements are chosen to avoid the imposition of any px selection on 
the muon, or invariant mass reqnirements on the D*^pT system, crucial for preserving 
the distinct kinematic distributions of the —)■ )■ decay. Events 

are required to pass the hardware trigger either because the decay products of the D*~^ 
candidate satisfy the hadron trigger reqnirements or becanse high-px particles in the 
event, independent of the , satisfy one of the hardware trigger requirements. In the 

software trigger, the events are reqnired to meet criteria designed to accept —)■ K~n^ 

candidates with px > 2 GeV/c. Qnality reqnirements are applied to the tracks of the 
charged particles that originate from a candidate decay: their momenta must exceed 
5GeV/c and at least one must have px > 1.5GeV/c. The momentum vector of the 
candidate must point back to one of the PVs in the event and the reconstrncted mass 
must be consistent with the known mass (^ . 

In the offline reconstruction, the candidates satisfying the trigger are further required 
to have well-identihed K~ and danghters, and the decay vertex is reqnired to be 
signihcantly separated from any PV. The invariant mass of the candidate is required to 
be within 23.5 MeV/c^ of the peak value, corresponding to approximately three times the 
mass resolution. These candidates are combined with low-energy pions to form candidate 
D*+ decays, which are subjected to a kinematic and vertex fit to the decay chain. 

Gandidates are then required to have a mass difference Am = m{D^7i~^) — m{D^) within 
2MeV/c^ of the known value, corresponding to approximately 2.5 times the observed 
resolntion. The mnon candidate is required to be consistent with a muon signatnre in 
the detector, to have momentum 3 < p < lOOGeV/c, to be signihcantly separated from 
the primary vertex, and to form a good vertex with the candidate. The 
combinations are reqnired to have an invariant mass less than 5280MeV/c^ and their 
momentnm vector must point approximately to one of the reconstrncted PV locations, 
which removes combinatoric candidates while preserving a large fraction of semileptonic 
decays. In addition to the signal candidates, two independent samples of “wrong sign” 
candidates, and D^7T~p~, are formed for estimating the combinatorial background. 
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The former represents random combinations of D*^ candidates with muons from unrelated 
decays, and the latter is used to model the contribution of misreconstructed D*^ decays. 
Mass regions 5280 < m{D*^jj,~) < 10000MeV/c^ and 139 < Am < 160MeV/c^ are 
included in all samples for study of the combinatorial backgrounds. Finally, a sample of 
candidates is selected where the track paired with the D*^ fails all muon identification 
requirements. These candidates are used to model the background from hadrons 

misidentified as muons. 

To suppress the contributions of partially reconstructed B decays, including B decays 
to pairs of charmed hadrons, and semileptonic B —)■ decays with n > 1 

additional pions, the candidates are required to be isolated from additional tracks 

in the event. An algorithm is developed and trained to determine whether a given track is 
likely to have originated from the signal B candidate or from the rest of the event based on 
a multivariate analysis (MVA) method. For each track in the event, the algorithm employs 
information on the track separation from the PV, the track separation from the decay 
vertex, the angle between the track and the candidate momentum vector, the decay length 
significance of the decay vertex under the hypothesis that the track does not originate 
from the candidate and the change in this significance under the hypothesis that it does. 
A signal sample, enriched in B^ —)■ and B^ —)■ D*^^~T'^ decays, is constructed 

by requiring that no tracks in the event reach a threshold in the MVA output. In addition, 
the output is used to select three control samples enriched in partially reconstructed B 
decays of interest for background studies by requiring that only one or two tracks be 
selected by the MVA jj,~ ti~ or D *^or that at least one track selected by 
the MVA passes identification requirements {D *^These samples are depleted 
of B^ —)■ D*^jjTVfj, and B^ —)■ D*^T~Vr decays and are used to study and constrain the 
shapes of remaining backgrounds in the signal sample. 

The efficiencies £s and for the signal and the normalization channels, respectively, 
are determined in simulation. These include the effects of the trigger, event reconstruction, 
event selection, particle identification procedure, isolation method, and the detector 
acceptance. To account for the effect of differing detector occupancy distributions between 
simulation and data, the simulated samples are reweighted to match the occupancy 
observed in data. The overall efficiency ratio is Ss/Sii = (77.6 ± 1.4)%, with the deviation 
from unity primarily due to the particle identification, which dominantly removes low-px 
muon candidates, and vertex quality requirements. 

The separation of the signal from the normalization channel, as well as from background 
processes, is achieved by exploiting the distinct kinematic distributions that characterize 
the various decay modes, resulting from the /i — r mass difference and the presence of extra 
neutrinos from the decay t~ — )■ The most discriminating kinematic variables 

are the following quantities, computed in the B rest frame; the muon energy, the 
missing mass squared, defined as and the squared four-momentum 

transfer to the lepton system, = {p^ —PdYj where and p)) are the four-momenta 

of the B meson, the D*~^ meson and the muon. The determination of the rest-frame 
variables requires knowledge of the B candidate momentum vector in the laboratory frame, 
which is estimated from the measured parameters of the reconstructed final-state particles. 
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The B momentum direction is determined from the unit vector to the B decay vertex 
from the associated PV. The component of the B momentum along the beam axis is 
approximated using the relation {j)b)z = ;y^(Preco)^, where is the known B mass, 
and rureco and preco are the mass and momentum of the system of reconstructed particles. 
The rest-frame variables described above are then calculated using the resulting estimated 
B four-momentum and the measured four-momenta of the ii~ and D*^ . The rest-frame 
variables are shown in simulation studies to have sufficient resolution 15%-20% full 
width at half maximum) to preserve the discriminating features of the original distributions. 

Simulated events are used to derive kinematic distributions from signal and B back¬ 
grounds that are used to ht the data. The hadronic transition-matrix elements for 
B^ —)■ and B^ —)■ 

from heavy quark effective theory 
parameters are taken from Ref. 


V, 


26 
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decays are described using form factors derived 
Recent world averages for the corresponding 
These values, along with their correlations and 
uncertainties, are included as external constraints on the respective fit parameters. The 

decays include a helicity-supressed 
This parameter is not 




T Vr 


Vn decays 


28 


hadronic matrix elements describing B^ ^ D 
component, which is negligible in B^ — 
well-constrained by data; hence, the central value and uncertainty from the sum rule 
presented in Ref. are used as a constraint. It is assumed that the kinematic properties 
of the B^ —)■ decay are not modified by any SM extensions. 

For the background semileptonic decays B —)■ (2420), ^ 2 (2460), (2430))/i“T 
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(collectively referred to as R —)■ R**(—)■ form factors are taken from Ref. 

The slope of the Isgur-Wise function 30,is included as a free parameter in the 
ht, with a constraint derived from htting the control sample. This ht also 

serves to validate this choice of model for this background. Contributions from R° —)• 
(£>^■1(2536), 0*^2 (2573))/i“F^ decays use a similar parameterization, keeping only the lowest- 
order terms. Semileptonic decays to heavier charmed hadrons decaying as D** — D*~^7nr 
and semitauonic decays B —)■ (^^(2420), 772(2460), R((2430))r“z/.r are modeled using the 
ISGW2 parameterization. To improve the modeling for the former, a ht is performed to 
the D*~^fi~7r~^7T~ control sample to generate an empirical correction to the distribution, 
as the resonances that contribute to this hnal state and their respective form factors are 
not known. The contribution of semimuonic decays to excited charm states amounts to 
approximately 12% of the normalization mode in the ht to the signal sample. 

An important background source is B decays into hnal states containing two charmed 
hadrons, B —)■ D*^HcX, followed by semileptonic decay of the charmed hadron —)■ 

This process occurs at a total rate of 6%-8% relative to the normalization mode. 
The template for this process is generated using a simulated event sample of B'^ and 
B^ decays, with an appropriate admixture of hnal states. Corrections to the simulated 
template are obtained by htting the control sample. A similar simulated sample 

is also used to generate kinematic distributions for hnal states containing a tertiary muon 
from B — D*^D~X decays, with D~ —)■ T~h'r and t~ —)■ 

The kinematic distributions of hadrons misidentihed as muons are derived based on the 
sample of D*^h^ candidates. Control samples of D*~^ (A) decays are used to determine 
the probabilities for a vr or 7F (p) to be misidentihed as a muon, and to generate a 3 x 3 
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matrix of probabilities for each species to satisfy the criteria for identihcation as a, 7r,K or 
p. These are used to determine the composition of the D*~^h^ sample in order to model the 
background from hadrons misidentified as muons. Two methods are developed to handle 
the unfolding of the individual contributions of vr, K, and p, which result in different 
values for TZ{D*). The average of the two methods is taken as the nominal central value, 
and half the difference is assigned as a systematic uncertainty. 

Combinatorial backgrounds are classified based on whether or not a genuine D*^ —)• 
decay is present. Wrong-sign D^tt~ pT combinations are used to determine the 
component with misreconstructed or false D*^ candidates. The size of this contribution is 
constrained by htting the Am distribution of candidates in the full Am region. The 

contribution from correctly reconstructed D*^ candidates combined with fi~ from unre¬ 
lated b hadron decays is determined from wrong-sign combinations. The size of this 

contribution is constrained by use of the mass region 5280 < m{D*^< 10000 MeV/fy, 
which determines the expected ratio of D*^pT to D*^pi^ yields. In both cases, the contri¬ 
butions of misidentihed muons are subtracted when generating the kinematic distributions 
for the fit. 

The binned E*, and distributions in data are ht using a maximum likelihood 

method with three dimensional templates representing the signal, the normalization and 
the background sources. To avoid bias, the procedure is developed and finalized without 
knowledge of the resulting value of TZ{D*). The templates extend over the kinematic 
region —2 < mT^^ < lOGeVfyfy in 40 bins, 100 < E*^ < 2500 MeV in 30 bins, and 
-0.4 < g2 < 12.6GeV7fy in 4 bins. The fit extracts: the relative contributions of signal 
and normalization modes and their form factors; the relative yields of each of the B —>■ 
!!**(—)■ D*^n)piii and their form factors; the relative yields of 5° —)■ D*^Kl)pt,~V^ 

and B — D*^Tn:)pL~V decays; the relative yield of 5 —)■ D*^Hc{-^ piuX')X decays; 
the yield of misreconstructed D*^ and combinatorial backgrounds; and the background 
yield from hadrons misidentihed as muons separately above and below \p^\ = 10 GeV. 
Uncertainties in the shapes of the templates due to the hnite number of simulated events, 
which are therefore uncorrelated bin-to-bin, are incorporated directly into the likelihood 
using the Beeston-Barlow ‘lite’ procedure [^. The ht includes shape uncertainties with 
bin-to-bin correlations {e.g. form factor uncertainties) via interpolation between nominal 
and alternative histograms. Gontrol samples for partially reconstructed backgrounds {i.e. 

D*p~7r~^n~, and D* pT K^) are ht independently from the ht to the signal 
sample. Since the selections used for these control samples include inverting the isolation 
requirement used to select the signal sample, this method allows for the determination of 
the corrections to the B — pivX')X and B —)■ D*~^7nTpi~T/n backgrounds with 
negligible inhuence from the signal and normalization events. The results are validated 
with an independently-developed alternative ht. In this second approach, control samples 
are ht simultaneously with the signal sample with correction parameters allowed to vary, 
allowing correlations among parameters to be incorporated exactly. This ht also forgoes 
the use of interpolation in favor of reweighting the simulated samples and recomputing 
the kinematic distributions for each value of the corresponding parameters. The two hts 
are extensively cross-checked and give consistent results. 
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Figure 1: Distributions of (left) and E* (right) of the four bins of the signal data, 

overlaid with projections of the fit model with all normalization and shape parameters at their 
best-fit values. Below each panel differences between the data and fit are shown, normalized by 
the Poisson uncertainty in the data. The bands give the In template uncertainties. 
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The results of the ht to the signal sample are shown in Fig. Values of the 
form factor parameters determined by the ht agree with the current 


^ D 


*+ ,,-Tl 




world average values. The ht hnds 363 000 ± 1600 D*^fi z/^ decays in the signal 

sample and an uncorrected ratio of yields N{B^ —)■ D*^t~Vt-)/N{B^ —= 


(4.54 ± 0.46) X 10 Accounting for the r 


-)■ /i z/^z/^ 


branching fraction 25 and the 


ratio of efficiencies results in TZ{D*) = 0.336 ± 0.034, where the nncertainty inclndes the 
statistical nncertainty, the nncertainty dne to form factors, and the statistical nncertainty 
in the kinematic distributions used in the ht. As the signal yield is large, this uncertainty is 
dominated by the determination of various background yields in the ht and their correlations 
with the signal, which are as large as —0.68 in the case of —?■ ^vX')X. 

Systematic uncertainties on TZ{D*) are summarized in Table The uncertainty 
in extracting TZ{D*) from the ht (model uncertainty) is dominated by the statistical 
nncertainty of the simnlated samples; this contribntion is estimated via the redaction in 
the ht uncertainty when the sample statistical uncertainty is not considered in the likelihood. 
The systematic uncertainty from the kinematic shapes of the background from hadrons 
misidentihed as muons is taken to be half the difference in TZ{D*) using the two unfolding 
methods. Form factor parameters are inclnded in the likelihood as nnisance parameters, 
and represent a source of systematic uncertainty. The total uncertainty on TZ{D*) estimated 


Table 1: Systematic uncertainties in the extraction of TZ{D*). 


Model uncertainties 

Absolnte size (xlO ^) 

Simnlated sample size 

2.0 

Misidentihed /i template shape 

1.6 

B^ — > T)*+(r“//i“)z7 form factors 

0.6 

B —> iivX')X shape corrections 

0.5 

B{B D**T-Vr)/B{B 

0.5 

B —)■ D**{^ D*7r7r)/iz/ shape corrections 

0.4 

Corrections to simnlation 

0.4 

Combinatorial backgronnd shape 

0.3 

B —)■ T)**(—)■ D*~^7i)fi~17n form factors 

0.3 

B —)■ D*~^{Ds —)■ Tu)X fraction 

0.1 

Total model nncertainty 

2.8 

Normalization uncertainties 

Absolute size (xlO“^) 

Simulated sample size 

0.6 

Hardware trigger efficiency 

0.6 

Particle identihcation efficiencies 

0.3 

Form-factors 

0.2 

B{t- /i“z7^z/r) 

< 0.1 

Total normalization nncertainty 

0.9 

Total systematic uncertainty 

3.0 
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from the fit therefore incorporates these sources. To separate the statistical uncertainty 
and the contribution of the form factor uncertainty, the ht is repeated with form factor 
parameters hxed to their best-ht values, and the reduction in uncertainty is used to 
determine the contribution from the form factor uncertainties. The systematic uncertainty 
from empirical corrections to the kinematic distributions of 5 —)■ D**{^ 
and B —)■ ^vX')X backgrounds is also computed based on hxing the relevant 

parameters to their best ht values, as described above. The contribution of S —?■ 

B —)■ /}**(—)■ D*^'K'K)T~Vr and 5° —)■ (Zi)ji^(2536), Zi)j2(2573))r“T,- events is 
hxed to 12% of the corresponding semimuonic modes, with half of this yield assigned as a 
systematic uncertainty on TZ{D*). Similarly the contribution of 5 —)■ D*~^Dj t~Vt) 
decays is hxed using known branching fractions (^, and 30% changes in the nominal 
value are taken as a systematic uncertainty. Corrections to the modeling of variables 
related to the pointing of the candidates to the PV are needed to derive the kinematic 
distributions for the ht. These corrections are derived from a comparison of simulated 
B^ —!■ D*^^~T'^ events with a pure B^ —data sample, and a systematic 
uncertainty is assigned by computing an alternative set of corrections using a different 
selection for this data subsample. 

The expected yield of candidates compared to candidates (used to 

model the combinatorial background) varies as a function of m{D*^fB"). The size of this 
effect is estimated in the 5280 < m{D*^fB) < 10000 MeV/c^ region and the uncertainty is 
propagated as a systematic uncertainty on 1Z{D*). 

Uncertainties in converting the htted ratio of signal and normalization yields into TZ{D*) 
(normalization uncertainties) come from the hnite statistical precision of the simulated 
samples used to determine the efficiency ratio, and several other sources. The efficiency 
of the hardware triggers obtained in simulation differs between magnet polarities and 
between Pythia versions — the midpoint of the predictions is taken as the nominal value 
and the range of variation is taken as a systematic uncertainty on the efficiency ratio. 
Particle identihcation efficiencies are applied to simulation based on binned J/V’ —)• iB 
and —)■ K~7i^ control samples, which introduces a systematic uncertainty that is 

estimated by binning the control samples differently and by comparing to simulated 
particle identification. The signal and normalization form factors alter the expected ratio 
of detector acceptances, and la variations in these with respect to the world averages are 
used to to assign a systematic uncertainty. Finally, the uncertainty in the current world 
average value of i3(r“ —)■ /i^z/^z/,-) contributes a small normalization uncertainty. 

In conclusion, the ratio of branching fractions TZ{D*) = B{B^ — /B{B^ — 
is measured to be 0.336 ± 0.027 (stat) ± 0.030 (syst). The measured value 
is in good agreement with previous measurements at BaBar and Belle and is 2.1 
standard deviations greater than the SM expectation of 0.252 ± 0.003 [^. This is the first 
measurement of any decay of a 6 hadron into a hnal state with tau leptons at a hadron 
collider, and the techniques demonstrated in this letter open the possibility to study a 
broad range of similar b hadron decay modes with multiple missing particles in hadron 
collisions in the future. 
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